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Abstract.  This  project  aims  at  extending  Constraint  Programming  (CP) 
with  four  promising  ideas:  linear  programming,  tlow  algoritlniLs.  random¬ 
ization  and  symmetry  breaking.  The  work  has  been  conducted  by  ILOG 
in  collaboration  with  I1SI  at  Cornell  University. 


1  Introduction 

We  have  focused  otir  attention  on  Cardinality  Matrix  Problems  (CMP). 
Cardinality  Matrix  Problems  are  expressed  by  a  matrix  where  each  row 
and  each  column  are  constrained  by  cardinality  constraints,  that  is  by 
constraints  that  define  the  number  of  times  each  value  in  a  row  or  in 
a  column  has  to  be  assigned  to  variables.  A  typical  example  of  CMP  is 
a  rostering  problems,  and  a  lot  of  real  world  problems  can  be  modelled 
as  a  CMP.  We  decided  to  mainly  consider  CMPs  because  it  lias  already 
been  proved  that  CP  is  certainly  one  of  the  best  techniques  to  solve  these 
problems.  Sports  scheduling  problems  ure  good  examples  of  this  claim, 
for  instance  the  National  Football  League  in  the  US  recently  adopted 
ILOG  OPL  and  ILOG  Solver  to  compute  the  schedule  of  the  season. 
The  IIS  institute  at  Cornell  has  been  working  for  several  years  on  a 
restricted  form  of  the  CMP:  the  alldtiT  matrix  problem.  In  this  case,  each 
value  has  to  be  assigned  at  most  once  in  every  row  and  every  column.  The 
alldifT  matrix  is  the  underlying  structure  of  several  real  world  problems, 
such  as  design  of  scientific  experiments  or  fiber  optics  routing.  Thus,  we 
decided  first  to  study  this  restriction  of  CMPs,  because  we  know  that 
most  of  the  results  that  we  obtain  for  alldiff  matrix  problem  could  be 
adapted  to  CMPs. 

We  obtained  two  types  of  results:  a  new  constraint  using  OR  algorithms 
and  dedicated  to  alldiff  matrix  problems  and  an  original  and  efficient 
met  hod  to  break  the  symmetries  in  that  kind  of  problems.  These  results 
are  presented  in  the  next,  section. 
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2  Results 

2.1  AlldiffMatrix  Constraint 

An  alldiff  matrix  problem  is  defined  by  a  matrix  and  imposes  an  alldiff 
constraint  on  every  row  and  every  column  of  the  matrix.  Consider  the 
following  example;  a  6x6  matrix  has  to  be  filled  with  numbers  ranging 
from  1  to  6.  A  classical  mode!  in  CP  consists  of  defining  one  variable  per 
cell,  each  variable  can  take  a  value  from  1  to  6,  and  one  alldiff  contraint 
per  row  and  one  alldiff  contraint  per  column.  Now,  consider  the  following 
situation: 
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lis  case,  the  alldiff  constraints  are  only  able  to  deduce  that: 

•  only  the  values  5  and  C  can  be  assigned  to  the  cells  (8, 3)  and  (6. 3) 

•  only  the  values  3  and  6  can  be  assigned  to  the  cells  (5,4)  and  (5,4). 


However,  with  a  careful  study  we  can  see  that  the  value  6  will  be  assigned 
either  to  (5,3)  and  (6,4)  or  to  (5,4)  and  (6.3)  this  means  that,  the  other 
columns  of  rows  5  and  6  cannot  take  these  values  and  therefore  we  can 
remove  the  value  6  from  the  domains  of  the  corresponding  variables. 
Such  a  filtering  algorithm  can  be  easily  established  thunks  to  the  Alld- 
iffMatrix  constraint.  This  constraints  proposes  to  add  a  new  alldiff  con¬ 
straint  based  on  variables  defined  on  the  symbols  that  are  used  and  to 
link  all  t  he  variables  of  all  the  alldiff  constraints. 

Moreover,  we  can  identify  t.he  filtering  algorithm  associated  with  this 
constraint.  We  can  prove  the  following  proposition: 


Proposition  1  Consider  an  alldiff  matrix  problem  Then,  an  alldiff  con 
straint  defined  on  .symbol  k  ensures  that  for  every  p  x  7  rectangles,  denoted 
by  T,  tuc  have: 
m(k)  tP  +  <l~n, 

where  m(k)  denotes  the  number  of  times  the  symbol  k  appears  in  T. 


This  means  that  the  AlldiffMatrix  constraint  is  able  to  ensure  with  a 
polynomial  algorithm  a  property  that  have  to  be  satisfied  by  an  expo¬ 
nential  number  of  rectangles. 


2.2  Symmetry  breaking 

Symmetries  in  search  problems  can  increase  the  size  of  the  search  space 
dramatically.  Breaking  symmetries  is  therefore  highly  desirable.  Symme¬ 
try  breaking  strategies  have  been  proposed  for  mat  rix  models,  consider¬ 
ing  the  permutation  of  rows  and  columns.  We  propose  a  representation 
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for  matrix  models  that  emphasizes  a  third  level  of  symmetry,  in  addi¬ 
tion  to  the  permutation  of  row’s  and  columns  —  permutation  of  symbols. 
Furthermore,  our  representation  scheme  also  allows  us  to  identify1  sym¬ 
metries  at  a  higher  level  oj  granularity,  namely  by  exchanging  portions  of 
rows,  columns,  or  symbols  This  feature  is  particular  useful  for  problems 
that  have  additional  constraints  per  row,  column,  or  symbol  that  prevent 
the  permutation  of  the  complete  row  (column,  symbol),  such  as  models 
with  pre- assigned  values  in  some  matrix  cells.  We  show  that  exchang¬ 
ing  portions  of  (or  entire)  rows,  columns,  or  symbols  in  au  alldiff  matrix 
model  corresponds  to  toggling  the  edge  labels  of  a  bipartite  graph,  which 
allows  us  to  efficiently  remove  symmetries  in  an  alldiff  matrix  model. 
The  main  advantage  of  our  approach  in  regards  to  the  previous  studies 
is  that  for  the  first  time  we  are  able  to  break  some  symmetries  whereas 
some  variables  are  preassigned 

The  results  we  obtained  are  fully  detailled  in  the  extended  abstract  which 
is  joined  with  this  report. 


3  Visit  to  Cornell 

Jean-Char  les  Flog  in  (II.  OG  Director  of  Constraint  Programming)  will 
visit  IISI  at  Cornell  from  March  27  to  April  3  200-1. 
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Abstract.  Cardinality  matrix  problems  are  the  underlying  structure 
of  several  real  world  problems  such  as  rostering,  sports  scheduling  ,  and 
timetabling.  These  are  hard  computational  problems  given  their  inherent 
combinatorial  structure.  Constraint  based  approaches  have  been  shown 
to  outperform  other  approaches  for  solving  these  problems  In  this  pa¬ 
per  we  propose  the  cardinality  matrix  constraint,  a  specialized  global 
constraint  for  cardinality  matrix  problems.  The  cardinality  matrix  con¬ 
straint  takes  advantage  of  the  intrinsic  structure  of  the  cardinality  matrix 
problems.  It  uses  a  global  cardinality  constraint  per  row  and  per  column 
and  one  cardinality  (0,l)-matrbc  constraint  per  symbol.  This  latter  con¬ 
straint.  corresponds  to  solving  a  special  case  of  a  network  flow  problem, 
the  transportation  problem,  which  effectively  captures  the  interactions 
between  rows,  columns,  and  symbols  of  cardinality  matrix  problems.  Our 
results  show  that  the  cardinality  matrix  constraint  outperforms  standard 
constraint  based  formulations  of  cardinality  matrix  problems. 


1  Introduction 

In  recent  years  Constraint  Programming  (CP)  techniques  have  been  shown  to 
effectively  solve  hard  combinatorial  problems.  In  fact,  constraint  based  methods 
excel  at  solving  problems  that  are  inherently  combinatorial,  dearly  outperform¬ 
ing  traditional  Operations  Research  (OR}  techniques.  Sports  scheduling  and  ros¬ 
tering  problems  are  good  examples  of  highly  combinatorial  problems,  for  which 
CP  based  techniques  have  been  shown  to  be  very  successful  (see  e.g.,  [11,5]). 

In  a  rostering  problem,  given  a  set  of  workers,  a  set  of  tasks,  and  a  set 
of  days  (typically  a  week),  the  goal  is  to  assign  the  tasks  per  person  and  per 
day  satisfying  various  constraints.  Among  them,  typical  constraints  require  the 
workload  per  day  to  be  constrained  by  the  number  of  times  each  task  has  t.o  be 
performed,  and  the  schedule  of  each  person  to  be  constrained  by  the  number  of 
times  each  task  lias  to  be  performed. 

Sports  scheduling  problems  and  rostering  problems  arc  particular  cases  of 
what  we  refer  to  as  cardinality  matrix  problems.  Cardinality  matrix  problems 
are  expressed  by  a  matrix  of  variables  where  each  row  and  each  column  are 
constrained  by  cardinality  constraints,  that  is  by  constraints  that  define  the 
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number  of  times  each  value  in  a  row  or  in  a  column  has  to  be  assigned  to 
variables.  We  can  model  the  rostering  problem  as  a  cardinality  matrix  problem 
in  which  each  row  of  the  matrix  corresponds  to  a  worker  ami  each  column  of  the 
matrix  corresponds  to  a  day  of  the  week.  The  values  in  the  matrix  correspond  to 
tasks.  The  cardinality  constraints  on  the  rows  constrain  the  number  of  tasks  to 
be  assigned  per  worker  and  the  cardinality  constraints  on  the  columns  constrain 
the  number  of  task  to  be  assigned  daily. 

A  straightforward  model  for  representing  cardinality  matrix  problems,  e.g.. 
rostering  problems,  consists  of: 

•  a  matrix  of  variables,  in  which  each  variable  corresponds  to  a  cell  that 
takes  as  value  the  task  to  be  performed  on  a  given  day  by  a  given  person.  The 
variable  domains  arc  the  set  of  tasks  that  can  be  performed  by  a  given  person 
for  a  given  day. 

•  a  global  cardinality  constraint  (GCC)  for  every  row.  which  constrains  the 
number  of  times  each  task  has  to  be  performed  by  the  person  corresponding  to 
the  row. 

•  a  global  cardinality  constraint  for  every  column,  which  constrains  the  num¬ 
ber  of  times  each  task  has  to  be  performed  for  the  day  corresponding  to  the 
column. 

This  formulation  uses  several  global  constraints  and  can  give  good  results  in 
practice.  However,  it  suffers  from  some  major  drawbacks,  namely: 

1 .  There  is  poor  communication  between  the  variables  constraining  the  number 
of  times  a  value  has  to  be  taken,  called  cardinality  variables. 

2.  The  communication  between  the  rows  and  the  columns  is  poor.  In  fact,  any 
GCC  defined  on  a  row  and  any  GCC  defined  on  a  column  have  only  one 
variable  in  common.  This  means  that  we  have  an  efficient  way  to  deal  with 
all  the  variables  of  a  row  (or  a  column)  as  a  whole,  but  we  are  not  able  to 
really  deal  with  the  notion  of  a  matrix. 

3.  The  GCCs  deal  with  a  set  of  predefined  intervals  constraining  for  each  value 
the  number  of  times  the  value  has  to  be  assigned.  In  real-life  problems,  vari¬ 
ables  defining  these  intervals  are  more  often  used.  Even  if  it  is  easy  to  deduce 
intervals  from  these  variables,  because  it  corresponds  to  the  boundaries  of 
these  variables,  we  do  not  have  filtering  algorithms  to  reduce  their  ranges 
in  the  general  case  (such  a  filtering  algorithm  has  been  proposed  when  the 
domains  of  the  variables  ou  which  the  GCCs  are  defined  are  ranges  [9]). 

The  communication  between  constraints  mentioned  in  (1)  and  (2)  can  be 
improved  by  adding  implied  constraints.  An  implied  constraint  for  a  given  CSP 
is  a  constraint  that  can  be  deduced  from  the  other  constraints  of  the  CSP. 
but  which  introduces  a  filtering  algorithm  that  can  reveal  inconsistencies  which 
arc  not  discovered  by  the  combination  of  the  filtering  algorithms  of  the  other 
constraints.  So  the  introduction  of  implied  constraints  can  lead  to  a  reduction 
of  the  number  of  backtracks  needed  to  find  one  solution  or  to  prove  that  there 
is  none.  The  introduction  of  implied  constraints  can  improve  dramatically  the 
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efficiency  of  search  since  it  allows  for  the  detect  ion  of  inconsistencies  earlier  than 
it  would  be  possible  if  such  constraints  were  not  stated  explicitly  (see  o.g.,  [3]). 

ihe  limitation  stated  in  point  (2)  deserves  a  more  careful  study.  Consider  a 
restricted  form  of  the  cardinality  matrix  problems:  the  alldiff  matrix  problem [8]. 
In  this  case,  each  value  has  to  be  assigned  at  most,  once  in  each  row  and  each 
column.  The  alldiff  matrix  characterizes  the  structure  of  several  real  world  prob¬ 
lems,  such  as  design  of  scientific  experiments  or  fiber  optics  routing.  Consider 
the  following  example:  a  6x6  matrix  has  to  be  filled  with  numbers  ranging  from 
1  to  6  (this  is  a  latin  square  problem).  A  classical  model  in  Cl’  consists  of  defin¬ 
ing  one  variable  per  cell,  each  variable  can  take  a  value  from  1  to  6,  and  one 
alldiff  constraint  per  row  and  one  alldiff  constraint  per  column.  Now,  consider 
the  following  situation: 
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In  this  case,  the  alldiff  constraints  are  only  able  to  deduce  that: 

•  only  the  values  5  and  6  can  be  assigned  to  the  cells  (5,3)  and  (6,3) 

•  only  the  values  3  and  0  can  be  assigned  to  the  cells  (5. 4)  and  (6, 4). 

However,  with  a  careful  study  we  can  see  that  the  value  6  will  be  assigned 
either  to  (5, 3)  and  (6,4)  or  to  (5, 4)  and  (6, 3)  this  means  that  the  other  columns 
of  rows  5  and  6  cannot  take  these  values  and  therefore  wre  can  remove  the  value  0 
from  the  domains  of  the  corresponding  variables  (the  ones  with  a  •  in  the  figure). 
We  will  show  how  our  approach,  using  what  we  refer  to  as  the  cardinality  (0,1)- 
matrix,  automatically  performs  these  inferences. 

One  of  the  key  successful  approaches  in  CP  has  been  the  identification  of  typ¬ 
ical  constraints  that  arise  in  several  real-world  problems  and  associate  with  them 
very  specialized  and  efficient,  filtering  algorithms,  so-called  global  constraints.  In 
recent  years  several  global  constraints  have  been  proposed  and  shown  to  boost 
dramatically  the  performance  of  CP  based  techniques. 

We  propose  the  cardinality  matrix  constraint  to  capture  the  structure  of 
cardinality  matrix  problems  such  as  the  rostering  problem.  A  cardinality  matrix 
constraint  (cardMatrix)  C  is  specified  in  terms  of  an  n  x  m  matrix  A  f  of  variables 
which  lake  their  values  from  a  set  of  s  symbols,  and  two  sets  (rowCard  and 
colCard)  of  cardinality  variables  that  specify  the  number  of  limes  each  symbol 
has  to  appear  in  a  row  (rowCard)  and  the  number  of  times  each  symbol  has  to 
appear  in  a  column  (colCard).  More  specifically,  the  set  of  cardinality  variables 
rowCard  constrains  the  number  of  variables  of  a  row  i  of  M  instantiated  to 
a  symbol  p  to  be  equal  to  rowCard[i,  p]  and  the  set  of  cardinality  variables 
colCard  constrains  the  number  of  variables  of  a  column  j  of  A/  instantiated  to  a 
symbol  <}  to  be  equal  to  colCard\j,  c/J.  In  order  to  take  advantage  of  the  structure 
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underlying  the  cardinality  matrix  constraint  we  introduce  a  constraint  named 
cardinality  (0,l}-matrix.  The  cardinality  (0,l)-matrix  is  a  p;trticular  case  of  a 
network  How  problem,  the  transportation  problem.  This  constraint  effectively 
captures  the  interactions  between  rows,  columns,  and  symbols  in  a  cardinality 
matrix  problem.  We  also  develop  a  simple  filtering  algorithm  for  the  cardinality 
matrix  constraint  with  a  low  complexity  that,  enables  ns  to  reduce  the  ranges  of 
the  cardinality  variables.  As  we  show  in  our  experimental  section,  we  obtain  very 
promising  results  which  allow  us  to  solve  problems  that  could  not  be  solved  before 
with  constraint,  programming  techniques.  We  also  compare  the  performance  of 
our  approach  against  standard  formulations  of  a  cardinality  matrix  problems. 
We  obtain  dramatic  speed  ups  with  our  approach. 

The  rest  of  the  paper  is  organized  as  follows:  In  the  next  section  wo  define  our 
notation  and  present,  definitions  concerning  constraint  programming  and  graph 
theory.  We  then  roughly  present  the  cardinality  matrix  cojistraint  and  propose 
a  simple  filtering  algorithm  for  reducing  the  ranges  of  cardinality  variables  of  a 
GCC.  Next,  we  introduce  the  Cardinality  (0,1)-Macrix  Constraint,  followed  by 
the  description  of  a  filtering  algorithm  for  the  Cardinality  Matrix  Constraint. 
We  present,  experimental  results  in  section  7,  followed  by  conclusions. 

2  Preliminaries 

T>o  =  {Do(x\),  ...  Do(zn)}  to  represent  the  set  of  initial  domains  of  .V.  Indeed, 
we  consider  that  any  constraint  network  .V  can  be  associated  with  an  initial 
domain  Do  (containing  £>),  on  which  constraint  definitions  were  stated. 

A  constraint  C  on  the  ordered  set  of  variables  X(C)  =  (x* . x,-„)  is  a 

subset  T(C)  of  the  Cartesian  product  Do(zt, )  x  x  Dn(x,r)  that  specifies 
the  allowed  combinations  of  values  for  the  variables  xu...txr.  An  element  of 
j  \ )  x  x  Do(xr)  is  called  a  tuple  on  A'fC").  r[x]  denotes  the  value  of  x  in 
the  tuple  t 

Let  C  be  a  constraint.  A  tuple  r  on  X {C)  is  valid  if  Vx  e  X (C),  r[x]  €  D(x).  C 
is  consistent  iff  there  exists  a  tuple  r  of  T(C)  which  is  valid.  A  value  a  6  D(x) 
is  consistent  with  C  iff  x  {7  X(C)  or  there  exists  a  valid  tuple  r  of  T(C) 
with  a  =  r[x].  A  constraint  is  arc  consistent  iff  Vx,  6  X(C),D(x,)  gt  0  and 
Vo  e  D(Xi),  a  is  consistent  with  C. 

The  value  graph  of  a  set  of  variables  A'  is  the  bipartite  graph  GV(X)  = 
(XyUZi£xD(Xi),  E)  where  (x.a)  e  E  iff  a  e  D{x). 

We  recall  the  formal  definition  of  a  global  cardinality  constraint: 

Definition  1  A  global  cardinality  constraint  C  defined  on  X  and  associated 
with  a  set  of  values  V  with  D( X)  CV  is  a  constraint  in  which  each  value  u,  £  V 
is  associated  with  two  positive  integers  /,  and  ti,  with  l,  <  u,  and 
T(C)  =  {  r  s.t.  r  is  a  tuple  on  X (C) 

andVai  €  V  :  U  <  #(ai,r)  <  u.) 

It  is  denoted  by  ycc(X,  V.l.u)- 

Note  that  an  alldiff  constraint  can  be  defined  by  a  GCC  in  which  all  lower  bound 
are  equals  to  0  and  ail  upper  bounds  are  equal  to  1 . 


o 


o 


An  instantiation  of  all  variables  that  satisfies  all  the  constraints  is  called  a 
solution  of  a  CN.  Constraint  Programming  (CP)  proposes  to  search  for  a  solu¬ 
tion  by  associating  with  each  constraint  a  filtering  algorithm  that  removes  some 
values  of  variables  that  cannot  belong  to  any  solution.  These  filtering  algorithms 
are  repeatedly  called  until  no  new  deduction  can  be  made.  Then,  CP  uses  a 
search  procedure  (like  a  backtracking  algorithm)  where  filtering  algorithms  arc 
systematically  applied  when  the  domain  of  a  variable  is  modified. 


2.1  Graph  Theory 

These  definitions  are  based  on  books  of  [2, 16,  lj. 

A  directed  graph  or  digraph  G  -  (X,  U)  consists  of  a  node  set  X  and 
an  arc  set  U,  where  every  arc  (u,jj)  is  an  ordered  pair  of  distinct  nodes.  We 
will  denote  by  X{G)  the  node  set  of  G  and  by  U(G)  the  arc  set  of  G. 

A  path  from  node  t'i  to  node  ti*  in  G  is  a  list  of  nodes  [uj, ....  Wfc]  such  that 
(tij,  e,-+t )  is  an  arc  for  i  £  [1  ..k  —  1).  The  path  contains  node  t,  for  i  £  [l-.Jk]  and 
arc  (u,,tri_j-x)  for  i  €  [l..fc  —  1].  The  path  is  simple  if  all  its  nodes  are  distinct. 
The  path  is  a  cycle  if  k  >  1  and  Vi  =  u*.  An  undirected  graph  is  connected  if 
there  is  a  path  between  every  pair  of  nodes.  The  maximal  connected  subgraplis 
of  G  are  its  connected  components.  A  directed  graph  is  strongly  connected 
if  there  is  a  path  between  every  pair  of  nodes.  The  maximal  strongly  connected 
subgraphs  of  G  are  its  strongly  connected  components.  A  bridge  is  an  edge 
whose  removal  increases  the  number  of  connected  components. 

Let  G  be  a  graph  for  which  each  arc  (i,j)  is  associated  with  two  integers  /y 
and  iiy,  respectively  called  the  lower  bound  capacity  and  the  upper  bound 
capacity  of  the  arc.  A  How  in  G  is  a  function  /  satisfying  the  following  two 
conditions1  : 

•  For  any  arc  (i,j),  fij  represents  the  amount  of  some  commodity  that  can 
'‘flow”  through  the  arc.  Such  a  flow  is  permitted  only  in  the  indicated  direction 
of  the  arc,  i.e.,  from  i  to  j.  For  convenience,  we  assume  fi}  —  0  if  (i,j)  f.  f7(G) 

•  A  conservation  law  is  observed  at  each  node:  V)  G  X (G)  :  Ei  ft]  ~  E*  />*• 

A  feasible  flow  is  a  flow  in  G  that  satisfies  the  capacity  constraint,  that 
is,  such  that  V(t,j)  £  U(G)  ltJ  <  /y  <  u,;. 

Definition  2  The  residual  graph  for  a  given  flow  f,  denoted  by  R(f).  is  the 
digraph  with  the  same  node  set  as  in  G.  The  are  set  ofR(f)  is  defined  as  follows: 
V(t.j)  £  U(G): 

•  fi)  <  n,j  «  (i,j)  6  U(R(f))  and  upper  bound  rapacity  ry  =  uy  -  fi  j. 

•  fij  >  Lj  0,  i)  £  U (11(f  ))  and  upper  bound  capacity  r;i  =  fij  -  li}. 

All  the  lower  bound  capacities  are  equal  to  0. 


Without  loss  of  generality  (see  p.45  and  p.297  in  [1]),  and  to  overcome  notation 
difficulties,  we  will  consider  that  if  (i,  j)  is  an  arc  of  G  then  (j,  i)  is  not  an  arc  of  (», 
and  that  all  boundaries  of  capacities  are  nonnegative  integers. 
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2.2  Notation 

•  rnax(x)  (rcsp.  min(x))  denotes  the  maximum  (resp.  minimum)  value  of  D(x). 

•  D(X )  denotes  the  union  of  domaius  of  variables  of  X  (i.e.  D{X)  =  U-lfe.v  D(z:,)). 

•  #(<i,r)  is  the  number  of  occurrences  of  the  value  a  in  the  tuple  r 

•  #(a,  A')  is  the  number  of  variables  of  X  such  that  a  G  D(x). 

•  Row(M)  (resp.  Col(M))  is  the  set  of  indices  of  the  rows  (resp  columns)  of 
the  matrix  M. 

•  If  X  is  a  n  x  m  array,  that  is  X=x[iJ]),  then  cors(i.  «,  X)  =  {x[i,  j],j  -  l..m} 
and  vars(*,j,X)  =  {*[<■, j],i  =  l.-.n}. 


3  Cardinality  Matrix  Constraint:  Presentation 

Definition  3  A  cardinality  matrix  constraint  is  a  constraint  C  defined  on  u 
Matrix  M  -  x(i,jf]  of  variable  s  taking  their  values  in  a  act  1",  and  on  two  sets 
of  cardinality  variables  rowCard[i,j]  and  colCard[i.j]  and 
T(C)  =  (  r  s.i.  r  is  u  tuple,  on  X(C) 

and  Vo*  €  l ’.Vi  €  Row(M)  :  #(a*,tmra(», »,  A/))  =  rou<Card[i.k] 
and  Vo*  E  V,Vj  G  Col{M)  :  #(a*.  vars(i,  *.  Af))  =  colCard[j,  A-] 

It  is  denoted  by  card-M atrix(M,  V,  rowCard,  cotCard). 

In  order  to  show  how  a  cardinality  matrix  constraint  is  represented  we  need 
first  to  introduce  cardinality  variables.  The  GCCs  consider  that  the  lower  and 
the  upper  bounds  are  integer.  There  is  no  problem  to  use  variables  instead  of 
integers.  In  this  case,  the  lower  bound  is  the  minimal  value  of  the  domain  of  the 
variable  and  the  upper  bound  is  the  maximal  value  of  the  domain.  We  will  call 
such  variables  cardinality  variables.  Thus,  we  can  define  a  global  cardinality 
constraint  involving  cardinality  variables  (abbreviated  cardVar-GCC): 

Definition  4  A  global  cardinality  constraint  involving  cardinality  vari¬ 
ables  defined  on  X  and  card  and  associated  with  a  set  of  values  V  with  D(X)  C 
V  is  a  constraint  C  in  which  each  value  a,-  G  T  is  associated  with  a  cardinality 
variable  card  [if  and 
T{C)  =  {  r  s.f.  r  is  a  tuple  on  X(C) 

and  Va,  £  V  :  cordjt]  =  #(<i,,  r)} 

It  is  denoted  by  gcc(X.V.card). 

We  propose  to  represent  a  cardinality  matrix  constraint  by: 

•  one  cardVar-GCC  per  row  and  one  cardVar-GCC  per  column; 

•  a  sum  constraint  involving  the  previous  cardinality  variables  stating  that 
the  number  of  symbols  taken  by  all  the  rows  (resp.  all  the  columns)  is  the  size 
of  the  matrix; 

•  one  cardinality  (O.l)-matrix  constraint  involving  cardinality  variables  per 
symbol.  Such  a  constraint  involves  boolean  variables  corresponding  t,o  the  pres¬ 
ence  or  the  absence  of  the  symbol  for  a  cell  of  the  matrix,  and  combines  the  rows 
and  the  columns  for  the  symbol. 


Thus,  with  such  a  representation  the  communication  is  improved  in  two  ways: 

•  by  the  presence  of  cardinality  variables 

•  by  the  introduction  of  a  new  constraint  combining  the  rows  and  the 
columns  for  each  symbol. 

This  communication  will  be  efficent  if  some  powerful  filtering  algorithms  are 
available  to  reduce  the  domains  of  the  cardinality  variables  and  the  domains  of 
the  boolean  variables  on  which  cardinality  (O.l)-matrix  constraints  are  defined. 
This  is  what  we  study  in  the  next  sections. 


4  Filtering  Algorithm  for  costVar-GCC 

A  GCC  C  is  consistent  iff  there  is  a  flow  in  the  the  value  network  of  C  [12].  The 
consistency  of  gcc(X,  V,  card)  Ls  equivalent  to  the  consistency  of  the  constraint 
gcc{  A",  V,  l,  u)  where  for  every  a,  €  V  /[«]  =  rnin(c«r(i[t])  and  u[i]  =  mux(card[i]). 
When  the  minimum  or  the  maximum  value  of  the  domain  of  a  cardinality  variable 
is  modified  then  C  is  modified  and  so  the  consistency  of  the  constraint  must  be 
established  again.  Since  the  flow  algorithms  are  incremental,  a  new  feasible  flow 
can  be  computed  in  O(m).  where  m  is  the  number  of  arcs  of  the  network. 

Arc  consistency  for  cardVar-GCC  can  be  established  for  the  variables  of  A' 
by  using  the  method  of  GCCs,  because  the  problem  remains  the  same  for  these 
variables.  For  the  cardinality  variables  we  are  more  interested  in  the  validity 
of  the  minimum  and  the  maximum  value  of  the  domains.  Bound  consistency 
can  be  established  by  searching  for  the  minimum  and  the  maximum  value  such 
that  a  feasible  flow  exists.  However,  the  cost  of  this  method  is  high  and  its 
practical  advantage  has  not  been  proved  in  general-  Therefore,  wo  propose  a 
simple  filtering  algorithm  whose  cost  is  low  and  wliich  is  worthwliile  in  practice: 

Property  1  Lei  C  —  gee(X,  V’,  card)  be  a  cardVar-GCC.  Then,  we  have: 

•  G  V  cord[fj  <  #(<!,,  X) 

*  Eoi€VCttrdl4  =  lA'l 

The  second  point  is  a  classical  sum  constraint  and  bound  consistency  can  be 
established  in  0(|V|).  Then,  we  immediately  have  the  property: 

Property  2  Let  C  =  gcc{X,  V,  card)  be  a  cardVar-GCC,  GV(X)  be  the  value 
graph  of  X  Then  for  every  connected  component  CC  of  GV(X)  we  have: 
E0let.ai.(c;c;»c«r4'l  =  |t nrs(CC)\, 

where  vals(CC)  denotes  the  values  of  V  belonging  to  CC  and  vars  denotes  the 
variables  of  X  belonging  to  CC. 

proof:  AH  the  connected  components  are  disjoint  by  definition,  thus  the  problem  Is 
equivalent  to  a  disjunction  of  GCCs,  each  of  them  corresponding  to  a  connected  com¬ 
ponent.  Then,  Property  1  can  be  independently  applied  on  each  GCC  © 


o 
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The  filtering  algorithm  associated  with  cardinality  variables  is  defined  by 
Property  1  and  by  Property  2.  Tts  complexity  is  in  OflVJ)  for  all  the  sum  con¬ 
straints  that  can  be  defined  and  0(m)  for  the  search  for  connected  components, 
where  m  is  the  number  of  edges  of  the  value  graph  of  X  [15]. 

At  first  glance,  Property  2  seems  weak,  but  in  fact  this  is  not  true,  as  shown 
by  the  following  property: 

Property  3  Let  C  =  gcd(X,V,card)  he  a  card Var-GCC,  c.y  be  a  cardinality 
variable,  and  k  be  an  integer.  If  eg  =  k  in  every  solution  of  C  then  the  domain 
of  eg  is  set  to  k  after  establishing  arc  consistency  of  the  A'  variables  and  after 
establishing  bound  consistency  of  sum  constraints  defined  by  Property  2 

In  order  to  prove  this  property  we  need  first  to  introduce  a  theorem  which  is  a 
generalization  of  a  property  used  to  establish  arc  consistency  for  a  GCC,  because 
it  deals  with  any  kind  of  lower  and  upper  bound  capacities,  and  not  only  (0,1), 

Theorem  1  Let  f  be  a  feasible  flow  in  N .  and  (x,a)  be  on  arc  of  N .  Then, 
for  every  feasible  flow  f  in  N;  /*»  =  f'za  if  and  only  if  one  of  the  following 
property  is  satisfied: 

(0  (*,«)**(/)  and  («,*)*/*(/) 

(ii)  R(f)  contains  (x, «)  or  (a.  x)  but  not  both  and  x  and  a  belong  to  two 
different  strongly  connected  components  of  R(f) 

(Hi)  (x.a)  £  R(f)  and(a,X )  C  R(f)  and  (x.a)  is  a  bridge  of  ud(scc(R(f),x)), 
where  ud(scc(R(f),  x) )  is  the  undirected  version  of  the  strongly  connected  com¬ 
ponent  of  R(f)  containing  x. 

proof:  (»)  From  definition  of  R(f),  this  means  that  l(x.a)  —  u(a,x),  so  the  flow  value 
cannot  be  changed  in  any  feasible  flow 
(ii)  The  flow  theory  claims  that: 

•  the  flow'  value  of  (x.a)  can  be  increased  if  and  only  if  (x.a)  €  /?(/)  and  there 
is  path  from  a  to  r  in  /?(/)  -  ((a,x)},  that  Is  in  R(f)  in  this  case  because  we  have 

<*,o)e  *(/)=>  («,*)  i  R(f). 

•  the  flow'  value  of  (x,  a)  can  be  decreased  if  and  only  if  (a,  x)  e  R(J)  and  there 
is  path  from  x  to  a  in  R(f)  —  {(/,#)],  that  is  in  R(f)  in  this  case  because  we  have 
(«,*)£  *(/)=*(*, a)(R(f). 

So  in  this  case,  a  flow  value  is  constant  if  and  only  if  u  and  x  belong  to  two  different 
strongly  connected  components. 

(tit)  We  will  call  non  trivial  (u,  v)  cycle,  a  directed  cycle  which  contains  (u,  v)  but  not 
(v.  u).  There  are  two  possibilities: 

1)  there  is  a  non  trivial  (x,  a)  cycle  or  a  non  trivia!  (a,  x)  cycle  This  means  that 
the  flow'  can  be  increased  or  decreased,  therefore  it  has  not  the  same  value  for  every 
feasible  flow.  Moreover,  there  exists  a  directed  cycle  which  is  non  trivial,  so  this  cycle  is 
also  a  cycle  in  the  undirected  version  and  the  arc  (x.a)  is  not  a  bridge  and  conversely. 

2)  there  does  not  exist  a  non  trivial  (x,  a)  cycle  and  there  does  not  exist  a  non 
trivial  (a,  x)  cycle  Let  X(x)  be  the  set  of  nodes  of  scc(R(f)  —  (o},x),  and  .Y(«)  be  the 
set  of  nodes  of  $cc(R(f)  —  (x},n).  Then  Vp  e  ,Y(x),  p  J-  x  and  e  ,Y(a),  7  /  a,  we 
can  prove  that  the  arcs  (p.g),  ( q.p ),  (x.g),  (7,  x),  (a.p),  ( p.a )  do  not  exist  Suppose 
that  (p.g)  exists.  Then,  there  is  a  path  from  x  to  p  which  does  not  contain  a  and  an  arc 
(p,  7)  and  a  path  from  7  to  a  which  does  not  contain  x,  therefore  this  means  that  we 


identify  a  non  trivia)  (a.x)  cycle,  which  contradicts  the  hypothesis.  A  similar  reasoning 
is  valid  for  al)  the  arcs.  Hence,  if  (x,n)  and  (a.z)  are  removed  from  R(f)  then  x  and 
a  will  belong  to  two  different  connected  component  of  the  undirected  version  of  R(f). 
This  is  equivalent  to  saying,  that  (*,«)  is  a  bridge.© 

Now,  we  can  give  a  proof  of  Property  3:  Let  a  be  the  value  whose  cardinality 
is  cy,  and  /  be  a  feasible  flow  of  N(C),  the  value  network  of  C.  For  convenience 
we  will  use  USCC  -  .id(scc(fl(/),a)),  SCC  =  scc(R(f),a),  CC  =  cc(GV(A'),a), 
As  —  ear a(SCC),  and  Vs  —  vnls(SCC)  If  the  flow  value  of  (a.t)  is  the  same  for 
every  leasible  flow,  then  from  Theorem  1  either  a  and  t  belong  to  different  connected 
components  or  (a,  ()  Ls  a  bridge  of  USCC. 

In  the  first  case,  this  means  that  all  the  arcs  between  a  value  of  SCC  and  t  have 
the  same  direction,  In  other  words,  the  flow  value  of  these  arcs  is  cither  equal  to 
the  lower  bound  capacity  or  is  equal  to  the  upper  bound  capacity.  So  we  have  either 
I  As  I  =  5Z,-.,,;v'5  rnin(oard[i])  or  ; Xs i  =  £aj€Vj  »w*(carrf[<]).  In  both  cases  the  bound 
consistency  of  the  constraint  |.YS|  =  gv/  carcf[«J  will  instantiate  all  these  cardinality 
variables  to  the  current  flow  value  of  their  corresponding  arc 

In  the  second  case,  (a,  t)  is  a  bridge  of  USCC  and  the  value  graph  does  not  contain  t, 
so  CC  is  a  subgraph  of  USCC-  If  SCC  contains  t  and  if  (a,  f )  Ls  a  bridge  of  USCC  then 
(a.t)  and  (f,a)  exist  in  R(f)  and  there  is  no  other  arc  between  vals(CC)  and  t.  Thus, 
the  lower  and  the  upper  bound  capacities  are  equal  for  every  value  of  vals(CC)  which 
Ls  not  equal  to  a  In  this  case,  the  bound  consistency  of  the  sum  constraint  involving 
cy  will  instantiate  cy  to  the  current  flow  value  of  its  corresponding  arc.  © 


5  Cardinality  (0,1)-Matrix  Constraint 

5.1  Absence  of  Cardinality  Variables 

Definition  5  Let  M  -  x[i,  jj  be  a  matrix  of  (0,l)-variables.  .4  Cardinality 
(0,1)-Matrix  constraint  is  a  constraint  C  defined  on  M  in  which 

•  every  row  i  is  associated  with  two  positive  integers  /r[i]  and  ur[i]  with 

M‘1  <  wrl»l 

•  every  column  j  is  associated  with  two  positive  integers  lc[«j  and  uc(i]  with 
tc[i]  <  uc[i],arui 

T(C)  =  {  r  s.t.  t  is  a  tuple  on  X(C) 

and  Vi  €  Row(M) :  fr[i]  <  £>6Cd(Af)  <  ur[»l 

and  Vj  e  Col(M) :  /c[j]  <  £icRow(M)  x[i,j\  <  nc\j]} 

It  is  denoted  by  card-(0,l)-M atrix(M,  lr,  ur,  /c,  uc). 

This  constraint  corresponds  to  a  generalization  of  a  well  known  problem  named 
"Matrices  composed  of  0’s  and  l’s”  by  Ford  and  Fulkerson  [6].  In  this  latter 
problem,  there  is  no  lower  bound  for  the  rows  and  no  upper  bound  for  the 
columns.  Both  Ryser  [14]  and  Gale  [7]  independently  showed  that  this  problem 
can  be  solved  by  using  a  flow.  The  introduction  of  lower  bounds  on  rows  and 
upper  bounds  on  columns  only  slightly  modified  the  flow: 
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Definition  6  Given  M  =  x[i,j]  a  matrix  of  (0,1) -variables  and  C  =  card- 
(0,l)-Matrix(M,tr,  ur.lc.  uc)  a  cardinality  (OJ)-matrvc  constraint;  the  bipartite 
network  of  C ,  denoted  by  j V(C),  consists  of  a  node  set  defined  by: 

•  a  set  of  nodes  SR  =  {r, , ....  r„)  corresponding  to  the  rows  of  M. 

•  a  set  of  nodes  SC  =  {c],...,cm}  corresponding  to  the  columns  of  M . 

•  a  .source  node  $  and  a  sink  t 
and  an  arc  set  .4  defined  by: 

•  Vr,  €  SR.  (s.  r,)  £  .4  with  a  lower  bound  capacity  equal  to  fr[ij  and  an 
upper  bound  capacity  equal  to  nr{t’]. 

•  Vcj  g  SC  (cj,t)  €  .4  with  a  lower  bound  capacity  equal  to  lc[j]  and  an 
upper  bound  capacity  equal  to  uc(j]. 

•  Vr,  £  SR.  Vcj  €  SC  (r,,cj)  €  A  with  a  capacity  equal  to  x(»,  j],  that  is 
the  lower  bound  capacity  is  equal  to  mm{x[i,jJ)  and  the  upper  bound  capacity  is 
equal  to  mar(r(i,  j|) 

•  an  arr  (t,s)  without  capacity  constraint 

Note  that,  the  (0,1)- variables  define  the  capacity  constraints  of  the  arcs  between 
nodes  corresponding  to  rows  and  nodes  corresponding  to  columns. 

Proposition  1  C  is  consistent  if  and  only  if  there  is  a  feasible  flow  in  the 
bipartite  network  of  C . 

We  can  establish  arc  consistency  of  the  card-(0,l)-Matrix  constraint  by  a 
similar  method  to  the  one  used  for  GCCs2: 

Proposition  2  Let  C  be  a  consistent  cardinality  (O.l)-Matrix  constraint  and  } 
be  a  feasible  flow  in  the  bipartite  network  of  C.  Then  we  have: 

Vr;  6  S II.  Vc<  £  SC:  r;  and  c;  do  not  belong  to  the  same  strongly  connected 
component  in  R{f)  if  and  only  if  x[i,jj  —  /r,Cj . 

proof:  Immediate  from  Properties  (i)  and  (ii)  of  Theorem  1  (Property  (iii)  cannot  be 
applied  because  the  capacity  of  the  arcs  between  rows  and  columns  are  I)  or  1).  0 

Thus,  arc  consistency  can  be  established  by  only  one  identification  of  the 
strongly  connected  components  in  /?{/),  that  is  in  0(|A/|). 

The  advantage  of  the  cardinality  (0,1) -matrix  constraint  is  emphasized  b> 
the  following  theorem: 

Theorem  2  ConsidcrC  =  curd-(0, 1  )-Tiuitrix(M.lr,  ur.  Ic.  uc)  a  cardinality  (0.1)- 
matrix  constraint.  Establishing  arc  consistency  for  C  ensures  that  for  every  p  x  q 
rectangle,  denoted  by  T  we  simultaneously  have: 

53  *M>  53  ,rM  “  13  UCW  w 

(IJ)eT  i€  /tow(T)  jaiCoHM)-Col(T)) 

53  XM>  53  ,CM  -  53  urfr1  (2) 

(tJ)eT  iGCol(T)  je(«ow<W)-Kotii(T)) 


3  A  similar  constraint,  althrotigh  expressed  in  a  quite  different  way,  with  the  same 
kind  of  algorithm  to  establish  arc  consistency,  is  given  in  (lOj. 
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proof:  C  is  consistent.  Consider  Q  the  rectangle  containing  the  same  rows  as  T  and 
the  columns  that  are  not  contained  in  T.  Every  feasible  flow  of  N{C)  satisfied  the  con¬ 
straints  on  the  rows:  £0t>K(nwn*M  >  Ei£«ou,<n  ,r(0-  Wchave£(iJK(Tu£j)z[i,/) 

*M+E(,.J)eQ  80  E„„->,r  *M+E(.,«6«,  *M  >  E,€*,„(r,  M«l- 

Moreover  E(ij)€«  *[‘i  j\  2  Ejc(Col(M)-Cm(T))  ucUl  ,)0cau£t!  ‘he  constraints  on  the 
columns  of  Q  are  satisfied.  So,  Equation  1  is  satisfied. 

Similarly,  consider  Q  the  rectangle  containing  the  same  columns  as  T  and  the 
rows  that  are  not  contained  in  T  Every  feasible  flow  of  .V (C)  satisfies  the  constraints 
on  the  columns:  E(lj)€CruQ,*t''G  >  E.6c:<,i(7>  M»T  Wc  have  Ehjjwtuov *M  = 

*(''  -f)*80  E(ij)€'r  x[,’ii)+E(ij)eQ  *[*»  j]  -  E,6Cot<T>  M*' *■’] 

Moreover  E{<j)6Q s  Ejg(/w<.0)-K,„.  r»  «rbl.  because  the  constraints  on  the 
rows  of  Q  are  satisfied.  So,  Equation  2  is  satisfied.  0 

A  corollary  of  this  theorem  is  dose  to  a  necessary  condition  of  a  theorem 
proposed  for  latin  square  by  Ryser  [13]: 

Corollary  1  //Vi  6  Row{M)  fr[i]  =  ur[f]  =  1,  and  Vj  6  Col(M)  lc[j]  = 
nc\j]  =  1.  then  E«.,)€r  *(•>  i\  >  P~  ("  “  fl) 

Thus,  with  only  one  cardinality  (O.l)-matrix  constraint,  we  are  able  to  take 
into  account  a  property  which  is  available  for  all  p  x  q  rectangles  involved  in  the 
constraint.  Instead  of  having  an  exponential  number  of  cardinality  constraints 
(because  every  row  and  column  can  be  permuted)  we  have  only  one  cardinality 
(O.l)-matrix  constraint. 

5.2  Introduction  of  cardinality  variables 

In  a  way  similar  as  the  one  used  for  GCCs  we  propose  to  introduce  cardinality 
variables  in  Cardinality  (O.l)-Matrix  constraint. 

Definition  7  Let  M  =  x[i,  j]  be  a  matrix  of  (0,1) -variables  A  Cardinality 
(O.l)-Matrix  constraint  involving  cardinality  variables  is  a  constraint  C 
defined  on  M  and  row  Cant  and  colCard  in  which 

•  every  row  i  is  associated  with  a  cardinality  variable  rowCard[i ] 

•  every  column  j  is  associated  with  a  cardinality  variable  colCard[j\,and 
T(C)  —  {  r  s.t.  r  is  a  tuple  on  A '(C) 

and  Vi  C  Rou<(M)  :  E>€Cot(Af)  *fb J'l  =  rowCard[i] 
and  Vj  6  Col(M)  :  E,6ROa(A0  *[*. il  =  co/Card[j]} 

It  is  denoted  by  card-(0.1)-Matrix(M,  rmcCard,  colCard). 

The  consistency  of  C  -  card-(Q,\)-matrix(M,lr,ur,lc,ur. )  is  equivalent  to  the 
consistency  of  the  constraint  card-(0,l)-matrix(M,  rowCard,  colCard)  where 
Vi  €  Row(M)  fr[i]  =  tnin{r0U>Card[«])  and  ur[i]  =  max(rowCard[i\)  and 
Vj  €  Col(M)  fc[i]  =  m«n(co/Cart/[i])  and  uc[i]  =  maa(co/Card[iJ).  When  the 
minimum  or  the  maximum  value  of  the  domain  of  a  cardinality  variable  is  mod¬ 
ified  then  C  is  modified  and  so  the  consistency  of  the  constraint  must  be  estab¬ 
lished  again.  Since  the  flow  algorithms  are  incremental,  a  new  feasible  flow  can 
be  computed  in  O(m). 
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Arc  consistency  can  be  established  for  the  variables  of  M ,  because  the  prob¬ 
lem  remains  the  same  for  these  variables.  For  the  cardinality  variables  we  have 
similar  properties  as  for  cardVar-GCCs: 

Property  4  Let  C  —  gcc(X ,  i  .card)  be.  a  cardinality  (0,1  }- Matrix  constraint 
involving  cardinality  variables.  Then,  we  have: 

X<ieRou>(M)  rou>Card\i\  =  SjeCo((Af)  colCard[j] 

Bound  consistency  of  a  sum  constraint  involving  n  variables  can  be  established 
in  O(n).  As  for  cardVar-GCCs,  we  have  the  property: 

Property  5  Let  C  be  a  cardinality  (0,l)-matrix  constraint  involving  cardinality 
variables.  ud(N(C)  -  {«,  <}}  be  the  undirected  version  of  the  network  of  C  in 
which  the  node  s  and  t  have  been  removed.  Then  for  every  connected  component 
CC  of  ud(N(C)  -  {«,  t})  we  have: 

E,e/i„w(cc))r°wCar 41  =  Ljccotfcc,  colCardUl 

where  Row(CC)  denotes  the  rows  of  Af  belonging  to  CC  and  Col{CC)  denotes 
the  columns  of  M  belonging  to  CC. 

proof:  All  the  connected  components  are  disjoint  by  definition,  thus  the  problem  is 
equivalent  to  a  disjunction  of  cardinality-  (0,l)-matrix  constraint,  each  of  them  corre¬ 
sponding  to  a  connected  component.  Then,  Property  1  can  be  independently  applied 
on  each  constraint.  © 

The  filtering  algorithm  associated  with  cardinality  variables  is  defined  by 
Property  4  and  by  Property  5.  Its  complexity  is  0(|Rrw(A/)|  +  |Col(Af)|)  for  all 
the  sum  constraints  that  cart  be  defined  and  0(m )  for  the  search  for  connected 
Components,  where  m  is  the  number  of  edges  of  ud(N(C)  —  {*-,£}). 

6  Filtering  Algorithm  for  the  Cardinality  Matrix 
Constraint 

A  cardinality  matrix  constraint  is  modeled  by  a  cardVar-GCC  on  every  row, 
a  cardVar-GCC  on  every-  column,  a  constraint  between  the  sum  of  cardinality- 
variables,  and  a  cardinality  (O.l)-matrix  constraint  per  symbol: 

Definition  8  Let  C  =  card-M  atrix(M ,  V, rowCard, colCard)  be  a  cardinality 
matrix  constraint  involving  n  rows,  m  columns  and  s  symbols.  Then, 

•  for  every  row  i  we  define  Cr,  —  gcc(vars(i,  *,  Af),  V.  vars(i.  *,  rowCard)) 

•  for  every  column  j  we  define  Cc}  =  gcc(vars(*,j,  M),  V,vars(j,  *.  colCard) 

•  for  every  value  a *  6  V  we  define  the  cardinality  (0,l)-matrix 
Cm k  —  card-(0. 1  )-matrix(Bk ,  vam(* ,  k .  rowCard),  vars(*,k,  colCard)) 

•  for  every  value  at  6  V  and  for  every  variables  ar[t.  j].»  =  l..n,j  -  l..m 
we  define  the  (O.l)-variabtc  b[i,j,  A-]  and  the  constraint,  bri.j,  &]  =  1  a*  € 
D(x[».,j]).  We  will  denote  by  B/.  all  the  (0,l)-variablcs  defined  from  a*, and  by 
Cbt  the  set  of  constraints  defined  from  a*. 
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.  we  define  the  constraint s:  Crjr  :  £i=1  „  ,  rowCardfi,  k]  =  nm  and 

C9c  ■  E,=i  m  E*_i  .  colCard[j,k\  =  nm 
Given 

XQ  =  M  u  roxvCard  U  col  Card  U  ((JAt,  ,  Bk) 

T>{Xq)  the  set  of  domains  of  the  variables  Xq 

CQ=  (J  Cr,  U  |^J  Cci  U  Cbt  U  (J  Cm*  U  Cgr  U  Cgc 

i=l..n  j=  l.m  fc=l.  « 

The  constraint  network  Q  —  (Xq,'D(Xq),Cq)  is  called  the  constraint  net¬ 
work  associated  with  a  card-matrix  constraint. 

Propositions  Given  C  =  c.ar(t-mutrix(M ,  V,  rowCard,  colCard) ,  and 
H  —  (A/, D(M),  {C})  be  a  constraint  network  and  Q  the  constraint  network 
associated  with  C  then,  II  is  satisfiable  iff  Q  is  sattsfiable. 

proof:  When  the  M  variables  of  IJ  are  instantiated,  the  Cb  constraint  s  of  <3  instan¬ 
tiated  the  (0,t)-variables  of  Q  and  since  a  solution  satisfied  the  cardinality  constraint 
for  all  the  symbols  then  a  solution  of  Tl  is  a  solution  of  fi.  Conversely,  a  solution  of  Q 
ts  obviously  a  solution  of  II  because  the  M  and  the  cardinality  variables  of  Q  are  the 
variables  of  IT,  and  the  constraints  of  II  are  satisfied  by  any  solution  of  <2.0 

So,  a  card-matrix  constraint  can  be  filtered  by  applying  arc  consistency  to 
the  constraint  network  associated  with  it. 

7  Experiments 

In  order  to  perform  comparisons  with  other  approaches  for  which  there  are 
results  reported  in  the  literature  we  performed  our  empirical  analysis  for  the 
particular  case  of  the  cardinality  matrix  constraint  in  which  each  value  has  to 
be  assigned  at  most  once  in  each  row  and  column  (alldiff  matrix  constraint).  We 
used  hard  Latin  square  instances.  (The  benchmark  instances  are  available  from: 
http://mat.gsia.tnnu.edu/COLOR02  or  from  gomes@CS.COrnell.edu.) 

A  new  strategy  to  select  the  next  variable  and  the  value  to  branch  on  for 
Latin  square  problems  was  proposed  in  (4).  This  strategy  clearly  outperforms 
all  the  previous  ones  that  have  been  tested.  It  consists  of  selecting  the  variable 
with  the  minimum  domain  size  and  then  select  the  value  which  occurs  the  fewest 
times  in  the  domains  of  the  variables  of  the  rows  and  the  columns  of  the  selected 
variable.  Wo  will  denote  it  by  dom-lessO.  Thus  strategy  is  a  kind  of  minimum 
Conflict  strategy.  We  have  improved  this  strategy  by  breaking  the  tie  of  variables. 
When  two  variables  have  the  same  size  of  domain  we  select  t  he  one  for  which  the 
number  of  instantiated  variables  of  the  row  and  of  the  column  is  maximum.  We 
tested  several  combinations  (like  the  minimum  number  of  already  instantiated 
variables),  and  it  appears  that,  our  variant  is  the  most  robust  one.  Breaking  ties 
is  interesting,  but  the  ways  we  break  the  ties  seem  almost  equivalent.  We  will 
denote  our  new  strategy  by  dom-rnaxB-lessO. 
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2al!diff-AC 

dom-lessO 

3alldiff-AC 

dom-lessO 

2a!ldiff-GAC 

dom-lessO 

alldiff-matrix 

dom-lessO 

time 

#  fails 

time 

#fails 

time 

#  fails 

time 

#  fails 
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>  50,000 

>  50,000 
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3 
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>  50,000 
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22 
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14 
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We  present  two  sets  of  results.  The  first  one  is  a  comparison  of  our  method 
•with  the  approach  of  [-1],  the  most  competitive  CP  based  strategy.  We  will  see 
that  our  approach,  using  the  alldiff  matrix  constraint,  outperforms  the  approach 
reported  in  [4].  The  second  one  is  a  comparison  of  the  branching  strategies  when 
the  alldiff  matrix  constraint  is  used. 

The  V2all<iiff-GAC”  method  is  the  classical  model  using  2  alldiff  constraints 
associated  with  the  filtering  algorithm  establishing  arc  consistency.  The  "3alldiff- 
AC”  method  is  the  model  in  which  3  alldiff  constraints  have  been  used  but  the 
global  constraints  arc  not  used,  and  ”2alldiff-AC”  method  uses  only  2  alldiff  con¬ 
straints.  This  latter  method  has  been  used  by  [4].  All  the  experiments  have  been 
performed  on  a  Pentium  IV  M,  2Mhz  running  under  Windows  XP  Professional, 
and  HOC.  Solver  6.0.  The  code  is  available  upon  request  from  the  authors.  Thus, 
these  experiments  are  reproducible. 

These  results  clearly  show  that: 

•  difficult  instances  cannot  be  solved  without  efficient  filtering  algorithms 

•  the  alldiff-matrix  dearly  outperforms  2alkliff  models 

•  the  branching  strategy  we  propose  is  better  than  the  previous  ones. 
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Several  instances  remain  open  for  a  CP  approach:  qwh.order40.hoies528. 
qwh.orderl0.holes544.qwh.order40.holes560,  qwh.order33  holes.3Sl.bal, 
qwh.order50.holrdS2o.bal.  The  instance  qwh.order3o.holes405  is  solved  with  our 
approach  in  9,900  s  and  6,322,742  backtracks. 

8  Conclusion 

We  present  the  Cardinality  Matrix  Constraint  to  efficiently  model  cardinality 
matrix  problems.  Wc  also  propose  a  simple  filtering  algorithm  of  low  cost  to 
reduce  the  ranges  of  the  cardinality  variables.  The  cardinality  (0,l)-matrix  con¬ 
straint  is  a  particular  case  of  the  transportation  problem,  a  well-studied  network 
How  problem,  and  it  provides  a  good  representation  to  capture  the  interactions 
between  rows,  columns,  and  symbols.  We  report  results  for  the  Alldiff  Matrix 
constraint,  a  particular  case  of  the  Cardinality  Matrix  Constraint.  Our  results 
show  that  the  Alldiff  Matrix  constraint  clearly  outperforms  standard  formula¬ 
tions  of  Alldiff  Matrix  problems. 
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